Clustering with Minimum Spanning Trees
نویسندگان
چکیده
We propose two Euclidean minimum spanning tree based clustering algorithms — one a k-constrained, and the other an unconstrained algorithm. Our k-constrained clustering algorithm produces a k-partition of a set of points for any given k. The algorithm constructs a minimum spanning tree of a set of representative points and removes edges that satisfy a predefined criterion. The process is repeated until k clusters are produced. Our unconstrained clustering algorithm partitions a point set into a group of clusters by maximally reducing the overall standard deviation of the edges in the Euclidean minimum spanning tree constructed from a given point set, without prescribing the number of clusters. We present our experimental results comparing our proposed algorithms with k-means and the Expectation-Maximization (EM) algorithm on both artificial data and benchmark data from the UCI repository. We also apply our algorithms to image color clustering and compare them with the standard minimum spanning tree clustering algorithm.
منابع مشابه
Performanace of Improved Minimum Spanning Tree Based on Clustering Technique
Clustering technique is one of the most important and basic tool for data mining. Cluster algorithms have the ability to detect clusters with irregular boundaries, minimum spanning tree-based clustering algorithms have been widely used in practice. In such clustering algorithms, the search for nearest objects in the construction of minimum spanning trees is the main source of computation
متن کاملA Novel Algorithm for Meta Similarity Clusters Using Minimum Spanning Tree
The minimum spanning tree clustering algorithm is capable of detecting clusters with irregular boundaries. In this paper we propose two minimum spanning trees based clustering algorithm. The first algorithm produces k clusters with center and guaranteed intra-cluster similarity. The second algorithm is proposed to create a dendrogram using the k clusters as objects with guaranteed inter-cluster...
متن کاملReducing Runtime Values in Minimum Spanning Tree Based Clustering by Visual Access Tendency
Clustering has been widely used in data analysis. Dissimilarity assesses the distance between objects and this is important in Minimum Spanning Tree (MST) based clustering. An inconsistent edge is identified and removed without knowledge of prior tendency in MST based clustering, which explore the results of clusters in the form of sub-trees. Clustering validity is to be checked at every iterat...
متن کاملAn Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees
We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our algorithm with other related ones which demonstrates the general superiority of this parallel algorithm over other competing algorithms in terms of accuracy and s...
متن کاملLC Note: LC-TOOL-2004-020 arXiv:physics/0409039 CALORIMETER CLUSTERING WITH MINIMAL SPANNING TREES
We present a top-down approach to calorimeter clustering. An algorithm based on minimal spanning tree theory is described briefly. We present a top-down approach to calorimeter clustering. An algorithm based on minimal spanning tree theory is described briefly.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- International Journal on Artificial Intelligence Tools
دوره 20 شماره
صفحات -
تاریخ انتشار 2011